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(54) PIGMENT PROTEIN 

(57) An object of the present invention is to provide 
a novel chromoprotein derived from Cnidopus japoni- 
cus. The present invention provides a chromoprotein 
derived from Cnidopus japonicus having the following 
properties: 



(2) the molar absorption coefficient is 66,700 at 610 
nm; and 

(3) the pH sensitivity of light-absorbing property is 
stable at between pH 4 and pH 10. 



(1 ) the absorption maximum wavelength is 610 nm, 
and fluorescence is not emitted; 
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Description 

TECHNICAL FIELD 

5 [0001] The present invention relates to a novel chromoprotein. More specifically, the present Invention relates to a 
novel chromoprotein derived from Cnidopus japonicus, and the use thereof. 

BACKGROUND ART 

10 [0002] Green fluorescent protein (GFP) derived from Aequorea victoria, a jellyfish, has many purposes in biological 
systems. Recently, various GFP mutants have been produced based on the random mutagenesis and semi-rational 
mutagenesis, wherein a color is changed, a folding property is improved, luminance is enhanced, or pH sensitivity is 
modified. Fluorescent proteins such as GFP are fused with other proteins by gene recombinant technique, and mon- 
itoring of the expression and transportation of the fusion proteins is carried out. 

15 [0003] One of the most commonly used types of GFP mutant is Yellow fluorescent protein (YFP). Among Aequorea- 
derived GFP mutants, YFP exhibits the fluorescence with the longest wavelength. The values e and G> of the majority 
of YEPs are 60,000 to 100,000 M _1 cnr 1 and 0.6 to 0.8, respectively (Tsien, R. Y. (1998). Ann. Rev. Biochem. 67, 
509-544). These values are comparable to those of the general fluorescent group (fluorescein, rhodamine, etc.). Ac- 
cordingly, improvement of the absolute luminance of YFP is nearly approaching its limit. 

20 [0004] In addition, cyan fluorescent protein (CFP) is another example of the GFP mutant. Of this type of protein, 
ECFP (enhanced cyan fluorescent protein) has been known. Moreover, red fluorescent protein (RFP) has been isolated 
from sea anemone (Discoma sp.). Of this type of protein, DasRed has been known. Thus, 4 types of fluorescent 
proteins, that are, green fluorescent protein, yellow fluorescent protein, cyan fluorescent protein, and red fluorescent 
protein, have successively been developed. The range of the spectrum has significantly been expanded. 

25 

DISCLOSURE OF THE INVENTION 

[0005] An object of the present invention is to provide a novel chromoprotein derived from Cnidopus japonicus. 
[0006] The present inventors have conducted intensive studies directed towards achieving the aforementioned ob- 

30 ject. They have designed suitable primers based on information regarding the amino acid sequences of known fluo- 
rescent proteins. Using these primers, they have succeeded in the amplification and cloning of a gene encoding a 
novel chromoprotein from the cDN A library of Cnidopus japonicus exhibiting a green color. The present inventors have 
f urther analyzed the light-absorbing properties and pH sensitivity of the obtained chromoprotein derived from Cnidopus 
japonicus. The present invention has been completed based oh these findings. 

35 [0007] That is to say, the present invention provides a chromoprotein derived from Cnidopus japonicus having the 
following properties: 

(1) the absorption maximum wavelength is 610 nm, and fluorescence is not emitted; 

(2) the molar absorption coefficient is 66,700 at 610 nm; and 

40 (3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 1 0. 

[0008] In another aspect, the present invention provides a chromoprotein having either one of the following amino 
acid sequences: 

45 (a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids with 
respect to the amino acid sequence shown in SEQ ID NO:. 1, and having light-absorbing properties. 

[0009] In another aspect, the present invention provides a chromoprotein capable of emitting fluorescence, which 
50 has an amino acid sequence, wherein, with respect to the amino acid sequence shown in SEQ ID NO: 1, alanine as. 
an amino acid residue at position 28 is substituted by glycine, glutamic acid as an amino acid residue at position 41 is 
substituted by methionine, cysteine as an amino acid residue at position 145 is substituted by serine, and threonine 
as an amino acid residue at position 1 58 is substituted by isoleucine. 
[0010] In another aspect, the present invention provides: 

55 

a chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
substituted by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1 ; 

a chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
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substituted by methionine with respect to the amino acid sequence shown in SEQ ID NO: 1; 
a chromoprotein having an amino acid sequence, wherein glutamic acid as an amino acid residue at position 41 
is substituted by leucine, and phenylalanine as an amino acid residue at position 80 is substituted by glycine, with 
respect to the amino acid sequence shown in SEQ ID NO: 1 ; 
5 a chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 

amino acid residue at position 64 is substituted by phenylalanine with respect to the amino acid sequence shown 
in SEQ ID NO: 1; . 

a chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
amino acid residue at position 64 is substituted by histidine with respect to the amino acid sequence shown in 
10 SEQ ID NO: 1 ; and 

a chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein cysteine as an 
amino acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 143 is 
substituted by serine, and proline as an amino acid residue at position 199 is substituted by leucine, with respect 
to the amino acid sequence shown in SEQ ID NO: 1 . 

15 

[0011] In another aspect, the present invention provides DNA encoding the protein of the present invention. 
[0012] In another aspect, the present invention provides either one of the following DNAs: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 
20 (b) DNA encoding an amino acid sequence, which comprises a deletion,. substitution and/or addition of one or 

several amino acids with respect to the amino acid sequence shown in SEQ ID NO: 1 , and has light-absorbing 
properties. 

[001 3] In another aspect, the present invention provides DNA having either one of the following nucleotide sequenc- 
es es: 

(a) the nucleotide sequence shown in SEQ ID NO: 2; and 

(b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides with 
respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing prop- 

.30 . erties. 

[0014] In another aspect, the present invention provides DNA having the nucleotide sequence shown in any one of 
SEQ ID NOS: 12, 14, 16, 18, 20, or 22. 

[0015] In another aspect, the present invention provides a recombinant vector having the DNA of the present inven- 
ts tion. • 

[0016] In another aspect, the present invention provides a transformant having the DNA or recombinant vector of 
the present invention. 

[0017] In another aspect, the present invention provides a fusion protein composed of the chromoprotein of the 
present invention and another protein. 
40 [0018] In another aspect, the present invention provides a method for analyzing a physiologically active substance, 
which is characterized in that the FRET (fluorescence resonance energy transfer) method is applied using the chromo- 
protein of the present invention as an acceptor protein. 

[001 9] In another aspect, the present invention provides a light-absorbing reagent kit comprising the chromoprotein, 
DNA, recombinant vector, transformant, or fusion protein of the present invention. 



45 



50 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0020] 



Figure 1 shows the results obtained by measuring the absorption spectrum of the chromoprotein (KGr) derived 
from Cnidopus japonicus oi the present invention. The transverse axis indicates the wavelength of absorbed light, 
and the vertical axis indicates absorbance. 

Figure 2 shows the pH sensitivity of the absorption spectrum of the chromoprotein (KGr) derived from Cnidopus 
japonicus of the present invention. The transverse axis indicates pH value, and the vertical axis indicates absorb- 
55 ance. 61 0 nm indicates the absorbance that is specific to the chromoprotein (KGr) derived from Cnidopus japonicus 

of the present invention, and 277 nm indicates the absorbance (light absorption by aromatic amino acids) that is 
generally used as a quantitative amount of protein. In other words, it is shown that the amount of protein is constant 
at 277 nm, and the absorbance at 610 nm that is specific to the chromoprotein (KGr) derived from Cnidopus 
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japonicus of the present invention hardly changes in the range between pH 4 and pH 1 0. 
Figure 3 shows the fluorescence spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
. of the present invention (a mutant obtained by substituting A at position 28 by G, substituting E at position 41 by 
M, substituting C at position 1 45 by S, and substituting T at position 158 by I in the amino acid sequence of KGr). 
5 The transverse axis indicates wavelength, and the vertical axis indicates fluorescence intensity. The term "em" 

indicates a fluorescence spectrum, and the term "ex" indicates an excitation spectrum. 

Figure 4 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Y at position 64 by L in the amino acid sequence of KGr) . 
Figure 5 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
10 of the present invention (a mutant obtained by substituting Y at position 64 by M in the amino acid sequence of KGr). 

Figure 6 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Eat position 41 by L and substituting F at position 80 
by G in the amino acid sequence of KGr). 

Figure 7 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
15 Cnidopus japonicus of the present invention (a mutant obtained by substituting Y at position 64 by F in the amino 

acid sequence of KGr). 

Figure 8 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Y at position 64 by F in the amino acid sequence of KGr). 
Figure 9 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
20 Cnidopus japonicus of the present invention (a mutant obtained by substituting Y at position 64 by H in the amino 

acid sequence of KGr). 

Figure 1 0 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting Y at position 64 by H in the amino acid sequence of KGr). 
Figure 11 shows the fluorescence and excitation spectrums of a mutant of the chromoprotein (KGr) derived from 
25 Cnidopus japonicus of the present invention (a mutant obtained by substituting C at position 26 by V, substituting 

C at position 143 by S, and substituting P at position 199 by L in the amino acid sequence of KGr). 
Figure 1 2 shows the absorption spectrum of a mutant of the chromoprotein (KGr) derived from Cnidopus japonicus 
of the present invention (a mutant obtained by substituting C at position 26 by V, substituting C at position 143 by 
S, and substituting P at position 199 by.L in the amino acid sequence of KGr). 

30 

BEST MODE FOR CARRYING OUT THE INVENTION 

[0021] The embodiments of the present invention will be described in detail below. 

35 (1) Chromoprotein of the present invention 

[0022] The chromoprotein of the present invention is characterized in that it is derived from Cnidopus japonicus, and 
has the following properties: 

40 (1) the absorption maximum wavelength is 610 nm, and fluorescence is not emitted; 

(2) the molar absorption coefficient is 66,700 at 610 nm; and 

(3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 1 0. 

[0023] Cnidopus japonicus is one type of sea anemone belonging to Anthozoa of Cnidaria. Among the types of sea 
45 anemone that can be seen in Japan, Cnidopus japonicus has the highest degree of color mutation. Its withers height 
is always low, and it has a large number of warts on the body wall thereof. It has approximately 200 short tentacles. A 
parent sea anemone discharges developed embryos from its oral part. The discharged embryos become attached to 
the body wall of the parent sea anemone. Thereafter, the embryos are further developed, and as a result, they become 
baby sea anemone. Thus, this sea anemone was named Komochi Isoginchaku (a Japanese name meaning "seed sea 
50 anemone"). This type of sea anemone is distributed in the intertidal zones on the rock coasts between Hokkaido and 
Boso Peninsula and also in the zones immediately below the intertidal zones. 

[0024] It is to be noted that a chromoprotein having the aforementioned properties was isolated using Cnidopus 
japonicus as a starting material in the examples described later, but that the chromoprotein of the present invention 
may also be obtained from forms of sea anemone other than Cnidopus japonicus in some cases. Such a chromoprotein 
55 is also included in the scope of the present invention. 

[0025] As described in the examples below, the chromoprotein of the present invention has an absorption maximum 
wavelength of 610 nm and does not emit fluorescence. In addition, the present chromoprotein has a molar absorption 
coefficient of 66,700 at 610 nm. The molar absorption coefficient represents the amount of absorbed photons per mole 
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of molecule. Since the chromoprotein of the present invention does not emit fluorescence, the chromoprotein of the 
present invention can be used: (1 ) as an acceptor molecule (energy receptor) in FRET; (2) in development of a system 
for converting the energy of applied light into energy other than the light; and (3) in introduction of a mutation into the 
amino acid sequence of the protein to modify it so that it emits fluorescence. 

[0026] In addition, the chromoprotein of the present invention is characterized in that the pH sensitivity of light- 
absorbing properties is stable at between pH 4 and pH 1 0. That is to say, in the case of the chromoprotein of the present 
invention, the peak value of the absorption spectrum does not significantly fluctuate in the range between pH 4 and 
pH 1 0. Accordingly, even under the same conditions, the chromoprotein of the present invention can be used in a broad 
range of pH environments, and thus, the use of the chromoprotein in vivo has few restrictions. 

[0027] The examples of the chromoprotein of the present invention include a chromoprotein having either one of the 
following amino acid sequences: 

(a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids with 
respect to the amino acid sequence shown in SEQ ID NO: 1 . and having light-absorbing properties. 

[0028] The scope of "one or several" in the phrase "an amino acid sequence comprising a deletion, substitution and/ 
or addition of one or several amino acids" is not particularly limited in the present specification. For example, it means 
1 to 20, preferably 1 to 10, more preferably 1 to 7, further preferably 1 to 5, and particularly preferably 1 to 3. 
[0029] The term "light-absorbing properties" is used in the present specification to mean properties capable of ab- 
sorbing light having a certain wavelength. For example, an absorption maximum wavelength may be 610 nm as in the 
case of the chromoprotein described in the present specification, or the value of the absorption maximum wavelength 
may also be shifted. It is preferable that the pH sensitivity of light-absorbing properties is stable at between pH 4 and 
pH10. 

[0030] The chromoprotein of the present invention having the amino acid sequence shown in SEQ ID NO:. 1 in the 
sequence listing does not emit fluorescence. In the present invention, one or several amino acids are deleted, substi- 
tuted, and/or added with respect to the amino acid sequence shown in SEQ ID NO: 1 , so as to produce a protein having 
modified light-absorbing properties, or so as to produce a protein emitting fluorescence in some cases. The thus pro- 
duced proteins are also included in the scope of the present invention. 

[0031] A specific example of a fluorescent protein produced by such mutation of amino acids may be a fluorescent 
protein having an amino acid sequence, wherein, with respect to the amino acid sequence shown in SEQ ID NO: 1, 
alanine as an amino acid residue at position 28 is substituted by glycine, glutamic acid as an amino acid residue at 
position 41 is substituted by methionine, cysteine as an amino acid residue at position 145 is substituted by serine, 
and threonine as an amino acid residue at position 158 is substituted by isoleucine. 

[0032] Other specific examples of a fluorescent protein produced by such mutation of amino acids may include: a 
chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is substituted 
by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1; a chromoprotein having an amino acid 
sequence Wherein tyrosine as an amino acid residue at position 64 is substituted by methionine with respect to the 
amino acid sequence shown in SEQ ID NO: 1; a chromoprotein having an amino acid sequence, wherein glutamic 
acid as an amino acid residue at position 4^ is substituted by leucine, and phenylalanine as an amino acid residue at 
position 80 is substituted by glycine, with respect to the amino acid sequence shown in SEQ ID NO: 1 ; a chromoprotein 
capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine* as an amino acid residue at 
position 64 is substituted by phenylalanine with respect to the amino acid sequence shown in SEQ ID NO: 1 ; a chromo- 
protein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an amino acid residue 
at position 64 is substituted by histidine with respect to the amino acid sequence shown in SEQ ID NO: 1; and a 
chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein cysteine as an amino 
acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 143 is substituted by 
. serine, and proline as an amino acid residue at position 199 is substituted by leucine, with respect to the amino acid 
sequence shown in SEQ ID NO: 1. 

[0033] The method of obtaining the chromoprotein of the present invention is not particularly limited. The protein 
may be either a protein synthesized by chemosynthesis, or recombinant protein produced by a gene recombination 
technique. 

[0034] Where a recombinant protein is produced, it is necessary to obtain DNA encoding the protein. Appropriate 
primers are designed by using information regarding the amino acid sequence shown in SEQ ID NO: 1 of the sequence 
listing of the present specification and the nucleotide sequence shown in SEQ ID NO: 2 thereof. Using these primers, 
PGR is carried out by using cDNA library derived from Cnidopus japonicus as a template, so that DNA encoding the 
chromoprotein of the present invention can be obtained. The chromoprotein of the present invention can be produced 
by introducing this DNA into an appropriate expression system. Expression in an expression system will be described 
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later in the present specification. 

(2) DNA of the present invention 

5 [0035] According to the present invention, a gene encoding the chromoprotein of the present invention is provided. 
[0036] Specific examples of DNA encoding the chromoprotein of the present invention may include either one of the 
following DNAs: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 
10 (b) DNA encoding an amino acid sequence, which comprises a deletion, substitution and/or addition of one or 

' several amino acids with respect to the amino acid sequence shown in SEQ ID NO: 1, and has light-absorbing 
properties. 

[0037] Other examples of DNA encoding the chromoprotein of the present invention may include either one of the 
15 following DNAs: 

(a) the nucleotide sequence shown in SEQ ID NO: 2; and 

(b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides with 
respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing prop- 

20 erties. 



[0038] Further, examples of the DNA having the nucleotide sequence comprising a deletion, substitution and/or 
addition of one or several nucleotides with respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding 
a protein having light-absorbing properties, may include DNA having the nucleotide sequence shown in any one of 
25 SEQIDNOS: 12, 14, 16, 18, 20, or22. 

[0039] The DNA of the present invention can be synthesized by, for example, the phosphoamidite method, or it can 
also be produced by polymerase chain reaction (PCR) using specific primers. The DNA of the present invention is 
produced by the method described above in the specification. 

[0040] A method of introducing a desired mutation into a certain nucleic acid sequence is known to a person skilled 
30 jn the art. For example, known techniques such as a site-directed mutagenesis, PCR using degenerated oligonucle- 
otides, or the exposure of cells containing nucleic acid to mutagens or radioactive rays, are appropriately used, so as 
to construct DNA having a mutation. Such known techniques are described in, for example, Molecular Cloning: A 
Laboratory Manual, 2 nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY., 1989; and Current Protocols in 
Molecular Biology, Supplements 1 to 38, John Wiley & Sons (1 987-1 997). 
35 ■ v 

, (3) Recombinant vector of the present invention 

[0041] The DNA of the present invention can be inserted into a suitable vector and used. The type of a vector used 
in the present invention is not particularly limited. For example, it may be either a vector that can autonomously replicate 
40 (e.g., a plasmid, etc.), or vector that is incorporated into the genomes of host cells when it is introduced into the host 
cells and is then replicated together with the chromosome into which it is incorporated. 

[0042] The vector used in the present invention is preferably an expression vector. In an expression vector; elements 
necessary for transcription (e.g., a promoter, etc.) are functionally ligated to the DNA of the present invention. The 
promoter is a DNA sequence which shows a transcriptional activity in host ceils, and it is appropriately selected de- 
45 pending on the type of host cells. 

[0043] Examples of a promoter which can operate in bacterial cells may include a Bacillus stearothermophilus mal- 
togenic amylase gene promoter, a Bacillus Hcheniformis atpha-amylase gene promoter, a Bacillus amyloliquefaciens 
BAN amylase gene promoter, a Bacillus subtilis alkaline protease gene promoter, a Bacillus pumilus xylosidase gene 
promoter, Pp, and P L promoters of phage rhamda, and lac, tip and tac promoters of 

50 

Escherichia coli 

[0044] Examples of a promoter which can operate in mammalian cells may include an SV40 promoter, an MT-1 
(metallothionein gene) promoter, and an adenovirus-2 major late promoter. Examples of a promoter which can operate 
55 in insect cells may include a polyhedrin promoter, a P 1 0 promoter, an Autographa californica polyhidrosis basic protein 
promoter, a baculovirus immediate-early gene 1 promoter, and a baculovirus 39K delayed-early gene promoter. Ex- 
amples of a promoter which can be operate in yeast host cells may include promoters derived from yeast glycolytic 
genes, an alcohol dehydrogenase gene promoter a TPM promoter, and an ADH2-4c promoter. 
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[0045] Examples of a promoter which can operate in filamentous cells may include an ADH3 promoter and a tpiA 
promoter. 

[0046] In addition, an appropriate terminator such as a human growth hormone terminator, or a TPI1 terminator or 
ADH3 terminator for fungal cells, may be functionally bound to the DNA of the present invention, as necessary. The 
5 recombinant vector of the present invention may further have elements such as a polyadenylation signal (e.g., one 
derived from SV40 or the adenovirus 5E1b region), a transcription enhancer sequence (e.g., an SV40 enhancer), or 
a translation enhancer sequence (e.g., one encoding the adenovirus VA RNA). 

[0047] The recombinant vector of the present invention may further comprise a DNA sequence which enables the 
replication of the recombinant vector in host cells. SV40 replication origin is an example of such a sequence (when 

w the host cells are mammalian cells). 

[0048] The recombinant vector of the present invention may further comprise a selective marker. Examples of such 
a selective marker may include genes, complements of which are absent from host cells, such as a dihydrofolate 
reductase (DHFR) gene or a Shizosaccharomyces pombe TPI gene, and drug resistant genes such as ampicillin, 
kanamycin, tetracycline, chloramphenicol, neomycin or hygromycin-resistant genes. 

15 [0049] A method for ligating the DNA of the present invention, a promoter and, as desired, a terminator and/or a 
secretory signal sequence to one another and inserting these items into a suitable vector is known to a person skilled 
in the art. 



(4) Transformant of the present invention 

20 

[0050] A transformant can be produced by introducing the DNA or recombinant vector of the present invention into 
a suitable host. 

[0051] Any cell can be used as a host cell into which the DNA or recombinant vector of the present invention is 
introduced, as long as the DNA construct of the present invention can be expressed therein. Examples of such a cell 
25 may include bacteria, yeasts, fungal cells, and higher eukaryotic cells. 

[0052] Examples of bacteria may include Gram-positive bacteria such as Bacillus or Streptomyces, and Gram-neg- 
ative bacteria such as Escherichia coli. These bacteria may be transformed by the protoplast method or other known 
methods, using competent cells. 

[0053] Examples of mammalian cells may include HEK 293 cells, HeLa cells, COS cells, BHK cells, CHL cells, and 
30 CHO cells. A method of. transforming mammalian cells and expressing the introduced DNA sequence in the cells is 
also known. Examples of such a method may include the electroporation, the calcium phosphate method, and the 
lippfection method. 

[0054] Examples of yeast cells may include those belonging to Saccharomyces or Shizosaccharomyces. Examples 
of such cells may include Saccharomyces cerevisiae and Saccharomyces ktuyveri. Examples of a method of introducing 
35 a recombinant vector into yeast host cells may include the electroporation, the spheroplast method, and the lithium 
acetate method. 

[0055] Examples of other fungal cells may include those belonging to Filamentous fungi such as Aspergillus, Neu- 
rpspora, Fusarium or Trichoderma. Where Filamentous fungi are used as host cells, transformation can be carried out 
by incorporating DNA constructs into host chromosomes, so as to obtain recombinant host cells. Incorporation of DNA 
40 constructs into the host chromosomes is carried out by known methods, and such known methods may include ho- 
mologous recombination and heterologous recombination. 

[0056] Where insect cells are used as host cells, both a vector into which a recombinant gene is introduced and a 
baculovirus are co-introduced into insect cells, and a recombinant virus is obtained in the culture supernatant of the 
insect cells. Thereafter, insect cells are infected with the recombinant virus, so as to allow the cells to express proteins 
45 (described in, for example, Baculovirus Expression Vectors, A Laboratory Manual; and Current Protocols in Molecular 
Biology, Bio/Technology, 6, 47 (1988)). 

[0057] The Autographs californica nuclear polyhedrosis virus, which is a virus infecting to insects belonging to Bar- 
athra brassicae, can be used as baculovirus. 

[0058] Examples of insect cells used herein may include Sf9 and Sf21 , which are Spodoptera frugiperda ovarian 
so cells [Baculovirus Expression Vectors, A Laboratory Manual, W. H. Freeman & Company, New York, (1 992)], and HiFive - 
(manufactured by Invitrogen), which are Trichoplusia ni ovarian cells. 

[0059] Examples of the method of co-introducing both a vector into which a recombinant gene has been introduced 
and the above baculovirus into insect cells to prepare a recombinant virus may include the calcium phosphate method 
and the I ipofection method. 

55 [0060] The above transformant is cultured in an appropriate nutritive medium under conditions enabling the intro- 
duced DNA construct to be expressed. In order to isolate and purify the protein of the present invention from the culture 
product of the transformant, common methods of isolating and purifying proteins may be used. 
[0061] For example, where the protein of the present invention is expressed in a state dissolved in cells, after com- 
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pletion of the culture, cells are recovered by centrifugal separation, and the recovered cells are suspended in a water 
type buffer. Thereafter, the cells are disintegrated using an ultrasonic disintegrator or the like, so as to obtain a cell- 
free extract. A supernatant is obtained by centrifuging the cell-free extract, and then, a purified sample can be obtained 
from the supernatant by applying, singly or in combination, the following ordinary protein isolation and purification 

5 methods: the solvent extraction, the salting-out method using ammonium sulfate or the like, the desalting method, the 
precipitation method using an organic solvent, the anion exchange chromatography using resins such as diethylami- 
noethyl (DEAE) sepharose, the cation exchange chromatography using resins such as S-Sepharose FF (manufactured 
by Pharmacia), the hydrophobic chromatography using resins such as butyl sepharose or phenyl sepharose, the gel 
filtration method using a molecular sieve, the affinity chromatography, the chromatofocusing method, and the electro- 

10 phoresis such as isoelectric focusing. 

(5) Use of the chromoprotein of the present invention and a fusion protein comprising the same 

[0062] The chromoprotein of the present invention can be fused with another protein, so as to construct a fusion 
15 protein. The type of said another protein to be fused to the chromoprotein of the present invention is not particularly 
limited ; and preferred examples may include a protein which interacts with another molecule. The examples may include 
a receptor protein or ligand thereof, antigen, antibody and the like. 

[0063] A method of obtaining the fusion protein of the present invention is not particularly limited. It may be either a 
protein synthesized by chemosynthesis, or recombinant protein produced by a gene recombination technique. 

20 [0064] Where a recombinant fusion protein is produced, it is necessary to obtain DNA encoding the protein. The 
DNA encoding the chromoprotein of the present invention and the DNA encoding the another protein to be fused to 
. the chromoprotein, can be obtained by the method as mentioned above in this specification or by the method similar 
to it. Then, these DNA fragments are ligatedto one another by a gene recombination technique, so that DNA encoding 
the desired fusion protein can be obtained. This DNA is then introduced into an appropriate expression system, so that 

25 the fusion protein of the present invention can be produced. 

[0065] FRET (fluorescence resonance energy transfer) has been known as a means for analyzing the interaction 
. between molecules. In FRET, for example, a first molecule labeled with a cyan fluorescent protein (CFP) acting as a 
first fluorescent protein is allowed to coexist with a second molecule labeled with a yellow fluorescent protein (YFP) 
acting as a second fluorescent protein, so as to allow the yellow fluorescent protein (YFP) to act as an acceptor molecule 

30 and to allow the cyan fluorescent protein (CFP) to act as a donor molecule. Thus, FRET (fluorescence resonance 
energy transfer) is allowed to take place between both molecules, so as to visualize the interaction between the first 
and second molecules. Namely, in FRET, different dyes are introduced into two types of molecules. One dyes with a 
higher energy level (a donor molecule) is selectively excited, and the fluorescence of the dye is measured. Long- 
wavelength fluorescence from the other dye (an acceptor molecule) is also measured. The interaction between the 

35 molecules is visualized by using the difference between the amounts of both fluorescences. Only when both dyes are 
adjacent to each other due to the interaction of the two types of molecules, a decrease in the fluorescence of the donor 
molecule and an increase in the fluorescence of the acceptor molecule are observed by single wavelength excitation 
dual wavelength photometry. However, in a case where a chromoprotein is used as an acceptor molecule, a decrease 
in the fluorescence of the donor molecule occurs only when both dyes are adjacent to eac^h other by the interaction of 

to the two types of molecules. Such a decrease can be observed by single wavelength excitation single wavelength 
photometry. Thus, the use of the chromoprotein of the present invention enables facilitation of measurement appara- 
tuses. 

[0066] The chromoprotein of the present invention is particularly advantageous when it is used as an acceptor mol- 
ecule in FRET (fluorescence resonance energy transfer). That is to say, a fused form (a first fused form) of the chromo- 

45 protein of the present invention and a test substance is first produced. Then, a fused form (a second fused form) of 
another test substance interacting with the above test substance and another fluorescent protein is produced. There- 
after, the first fused form is allowed to interact with the second fused form, and the generated fluorescence is analyzed, 
so that the interaction between the aforementioned two types of test substances can be analyzed. FRET (fluorescence 
resonance energy transfer) using the chromoprotein of the present invention may be carried out either in a test tube 

50 or in a cell. 

(6) Kit of the present invention 

[0067] The present invention provides a light-absorbing reagent kit comprising at least one which is selected from 
55 the chromoprotein, fusion protein, DNA, recombinant vector or transformant, which are described in the present spec- 
ification. The kit of the present invention can be produced from commonly used materials that are known per se, by 
using common methods. 

[0068] Reagents such as the chromoprotein or the DNA are dissolved in an appropriate solvent, so that the reagents 
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can be prepared in a form suitable for conservation. Water, ethanol, various types of buffer solution, etc. can be used 
as such a solvent. . 

[0069] The present invention will be further described in the following examples. However, the present invention is 
not limited by these examples. 

EXAMPLES 

Example 1 : Isolation of gene encoding novel chromoprotein from sea anemone 

(1) Extraction of total RNA 

[0070 j A chromoprotein gene was isolated from sea anemone emitting a green color. Cnidopus japonicus emitting 
a green color was used as a material. Frozen Cnidopus japonicus was crushed in a mortar. 7.5 ml of "TRlzol" (GIBCO 
BRL) was added to 1 g (wet weight) of Cnidopus japonicus, and the mixture was homogenized, followed by centrifu- 
gation at 1 ,500 x g for 1 0 minutes. 1 .5 ml of chloroform was added to the supernatant. The mixture was stirred for 15 
seconds and then left at rest for 3 minutes. The resultant product was centrifuged at 7,500 x g for 15 minutes. 3.75 ml 
of isopropanol was added to the supernatant. The mixture was stirred for 1 5 seconds and then left at rest for 1 0 minutes. 
The resultant product was centrifuged at 17,000 x g for 10 minutes. The supernatant was discarded, and 6 ml of 70% 
ethanol was added thereto. The obtained mixture was centrifuged at 1 7,000 x g for 1 0 minutes. The supernatant was 
discarded, and the precipitate was dissolved in 200 uJ of DEPC water. Total RNA dissolved in the DEPC water was 
. 100 times diluted, and the values of O.D.260 and O.D.280 were measured, so as to determine the concentration of 
RNA. 1 .2 mg of the total RNA was obtained from a green individual. 

(2) Synthesis of first stand cDNA 

[0071 ] cDN A (33 jil) was synthesized from 4 \ig of the total RNA using a kit for synthesizing first strand cDN A, " Ready 
To Go" (Amersham Pharmacia). 

(3) Degenerated PGR 

[0072] Using 3 uJ out of the synthesized first strand cDNA (33 uJ) as a template, PCR was carried out. Primers were 
designed and produced by comparing the amino acid sequences of known fluorescent proteins, extracting similar 
portions, and converting them into nucleotide sequences. The sequences of the used primers are shown below: 



35 



40 



5'-GGIGSICCmTISCITT-3' (primer 1) (SEQ ID NO: 3); 



and 



45 



5'-AACTGGAAGAATTCGCGGCCGCAGAA'lTl , l"l , l l, ri"l , l , l'rri"ri , 'rr-3' (primer 2) 
(SEQ ID NO: 4), 

wherein I represents inosine, S represents C or G, and H represents A, T, or C. 



50 



55 



Composition of PCR reaction solution 


Template (first strand cDNA) 


3u,l 


X 10 taq buffer 


5uJ 


2.5 mMdNTPs . 


4 uJ 


100 uM primer 3 


1 jil 


1 00 uM primer 4 


1^. 


MilliQ 


35 uJ 


Taq polymerase (5 U/uJ) 


1uJ 
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RCR reaction conditions 

94°C, 1 minute (PAD) 

94°C, 30 seconds (denaturation) 

52°C t 30 seconds (annealing of the primers to the template) 
s 72°C, 1 minute (elongation of the primers) 

[0073] 30 cycles consisting of the above 3 steps were carried out. The annealing temperature was decreased 0.3°C 
per cycle. That is to say, the annealing temperature in the 30 th cycle was 43°C. 
72°C, 7 minutes (final elongation) 
10 Retention at 4°C 

[0074] Using 1 u.l of an amplified product obtained as a result of the first PCR reaction as a template, PCR was 
carried out once again under the same conditions. A 800-bp fragment (derived from the green individual) was cut out 
by agarose gel electrophoresis and then purified. This 800-bp fragment contained a 3MJTR portion as a whole. 

15 (4) Subcloning and sequencing 

[0075] The purified DNA fragment was ligated to a pT7-blue vector (Novagen). Escherichia coli (TG1) was trans- 
formed with the vector, and the obtained transformants were subjected to blue white selection. Thereafter, plasmid 
DNA was purified from white colonies of Escherichia coli. The nucleotide sequence of the inserted DNA fragment was 
20 determined by a DNA sequencer. The obtained nucleotide sequence was compared with the nucleotide sequences of 
other fluorescent protein genes to confirm that the nucleotide sequence of the DNA was derived from a fluorescent 
protein. 5'-RACE and 3'-RACE methods were applied to a gene that had been confirmed to be a part of a fluorescent 
protein gene, so as to carry out the cloning of a full-length gene. 

25 (5) 5'-RACE method 



[0076] In order to determine the nucleotide sequence of the S'-terminal side of the DNA fragment obtained by de- 
generated PCR, the 5'-RACE method was applied using 5'-RACE System for Rapid Amplification of cDNA Ends, Ver- 
sion 2.0 (GIBCO BRL). 3 u.g of the total RNA prepared in (1) above was used as a template. 
30 [0077] For the first amplification of DC-tailed cDNA of the green individual, the following primers were used: 



5'-GGCCACGCGTCGACTAGTACGGGHGGGIIGGGHG-3' (primer 3) (SEQ ED NO: 

as 5); 

and 



40 . 5'-AGACGAGGCAATTTCCATCAAG-3' (primer 4) (SEQ ID NO: 6), 

wherein I represents inosine. 

[0078] For the second amplification, the following primers were used: 

45 

5'-GGCCACGCGTCGACTAGTAC-3' (primer 5) (SEQ ID NO: 7); 



and 



so . 

5'-GGCTACGCTTCCATATTGGCAGTT-3' (primer 6) (SEQ ED NO: 8). 

PCR reaction conditions and the like were determined in accordance with the protocols attached with the kit. 
[0079] The 350-bp amplified band was cut out by agarose gel electrophoresis and then purified. The purified DNA 
55 fragment was ligated to a pT7-blue vector (Novagen). Escherichia coli (TG1 ) was transformed with the vector, and the 
obtained transformants were subjected to blue white selection. Thereafter, plasmid DNA was purified from white col- 
onies of Escherichia coli. The nucleotide sequence of the inserted DNA fragment was determined by a DNA sequencer. 
The entire nucleotide sequence is shown in SEQ ID NO: 2, and the entire amino acid sequence is shown in SEQ ID 
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NO: 1. 



Example 2: Expression of protein in Escherichia coii 

[0080] Primers corresponding to the N- and C-termini of the protein were produced from the obtained full-length 
nucleotide sequence. PCR was carried out using the primers and the first strand cDNA prepared in (2) above as a 
template. The used primers are as foliows: 



10 



5'-CGGGATCCGACCATGGCTTCCAAAATCAGC-3' (primer 7) (SEQ ID NO: 9); 



and 



15 



5'-CCGGAATTCTTAATTGTGACCAAGTTTAGATGGGCA-3' (primer 8) (SEQ ID 
NO: 10). 



20 



25 
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Composition of PCR reaction solution 


Template (first strand cDNA) . 


3fil 


X 1 0 pyrobest buffer 


5ul 


2.5 mM dNTPs 


4pJ 


1 00 fiM primer 7 


1 uJ 


100 u,M primer 8 


Ifil 


Milli Q 


35 uJ 


Pyrobest polymerase (5 U/uJ) 





RCR reaction conditions 

94°C, 1 minute (PAD) 

94 0 C, 30 seconds (denatu ration) 

55°C, 30 seconds (annealing of the primers to the template) 
72°^ 1 minute (elongation of the primers) • 

[0081] 30 cycles consisting of the above 3 steps were carried out. 
72°C, 7 minutes (final elongation) 
Retention at 4°C 

[0082] An amplified band of approximately 700 bp was cut out by agarose gel electrophoresis and then purified. The 
purified DNA fragment was subcloned into the BamHI-EcoRI site of a pRSET vector (Invitrogen), and it was then 
allowed to express in Escherichia coii (JM109-DE3). Since the expressed protein was constructed such that His-tag 
was attached to the N-terminus thereof, it was purified with Ni-Agarose gel (Ql AG EN). Purification was carried out in 
accordance with the attached protocols. 

Example 3: Analysis of protein 

(1 ) Analysis of light-absorbing properties 

[0083] The light-absorbing properties of the protein expressed in Example 2 were analyzed. 
[0084] An absorption spectrum was measured using a 50 mM HEPES solution (pH 7.5) containing a 20 u.M chromo- 
protein. A molar absorption coefficient was calculated from the peak value of this spectrum. In the case of the chromo- 
protein derived from the green individual (referred to as KGr), the absorption peak was observed at 610 nm, and no 
fluorescence was detected (Table 1 , Figure 1). 
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Table 1 



Properties of chron 


loprotein (KGr) 


Absorption 
maximum 


Fluorescence 
maximum 


Molar absorption 
coefficient 


quantum yield 


pH sensitivity 


Number of amino 
acids 


610 nm 




66,700 (610nm) 




Non 


232 



10 



15 



(1) Measurement of pH sensitivity 

The pH sensitivity of the protein expressed in Example 2 was analyzed. 

The absorption spectrum of the protein was measured in the following 100 mM buffer solution (Figure 2). 
The following buffer solutions were used for each pH: 

pH 4 and 5: Acetate buffer 
pH 6: MES buffer 
pH 7 and 8: HEPES buffer 
pH 9 and 10: Glycine buffer 



20 



25 
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The peak value did not significantly change at any pH. 
Example 4: Modification of KGr 

[0085] In KGr, A at position 28 was substituted by G, E at position 41 was substituted by M, C at position 145 was 
substituted by S, and T at position 158 was substituted by I, so that the KGr was modified to have an absorption peak 
at 444 nm and to emit yellow fluorescence with a peak at 534 nm (Figure 3). 

Example 5: Modification of properties of KGr by. amino acid substitution 

[0086] Y at position 64 that is a chromophore-forming amino acid (QYG) of KGr was substituted by L or M, so that 
the absorption peak became 418 nm and so that the absorption peak was shifted from the original absorption peak at 
61 0 nm to the shorter wavelength side (Figures 1 and 2). The amino acid sequence of the protein wherein Y at position 
64 was substituted by L is shown in SEQ ID NO: 11, and the nucleotide sequence thereof is shown in SEQ ID NO: 12. 
The amino acid sequence of the protein wherein Y at position 64 was substituted by M is shown in SEQ ID NO: 13, 
and the nucleotide sequence thereof is shown in SEQ ID NO: 14. 

[0087] E at position 41 was substituted by L, and F at position 80 was substituted by G t so that the absorption peak 
become 528 nm and so that the absorption peak was shifted from the original absorption peak at 61 0 nm to the shorter 
wavelength side (Figure 3). The amino acid sequence of this protein is shown in SEQ ID NO: 15, and the nucleotide 
sequence thereof is shown in SEQ ID NO: 16. 

[0088] Y at position 64 that is a chromophore-forming amino acid (QYG) was substituted by F, so that the absorption 
peak became 41 2 nm, and so that the absorption peak was shifted from the original absorption peak at 61 0 nm to the 
shorter wavelength side and the protein was further modified to emit fluorescence with a peak at 504 nm (Figures 4 
and 5). The amino acid sequence of this protein is shown in SEQ ID NO: 17, and the nucleotide sequence thereof is 
shown in SEQ ID NO: 18. 

[0089] Y at position 64 that is a chromophore-forming amino acid (QYG) was substituted by H, so that the absorption 
peak became 418 nm, and so that the absorption peak was shifted from the original absorption peak at 610 nm to the 
shorter wavelength side and the protein was further modified to emit fluorescence with a peak at 520 nm (Figures 6 
and 7). The amino acid sequence of this protein is shown in SEQ ID NO: 19, and the nucleotide sequence thereof is 
shown in SEQ ID NO: 20. 

[0090] C at position 26 was substituted by V, C at position 143 was substituted by S, and P at position 199 was 
substituted by L, so that the absorption peak became 585 nm, and so that the absorption peak was shifted from the~ 
original absorption peak at 610 nm to the shorter wavelength side and the protein was further modified to emit fluo- 
rescence with a peak at 625 nm. This fluorescent protein was defined as KGr Rb (Figures 8 and 9). The amino acid 
sequence of this protein is shown in SEQ ID NO: 21 , and the nucleotide sequence thereof is shown in SEQ ID NO: 22. 

INDUSTRIAL APPLICABILITY 

[0091] The present invention provides a novel chromoprotein derived from Cnidopus japonicus. The chromoprotein 
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of the present invention has desired fluorescence properties and low pH sensitivity. Thus, it is useful for molecular 
biology analysis. 
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SEQUENCE LISTING 
<110> RIKEN 
<120> Chromo protein 
<130> A31347A 
<160> 22 

<210> 1 
<211> 232 
<212> PRT 

<213> Cnidopus japbnicus 
<400> 1 

Met Ala Ser Lys He Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

.35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Tyr 

50 55 60 

Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 . 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 
. 115 120 125 
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Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 
130 135 140 

5 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 ' 150 155 160 

10 Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Yal Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220* 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 

30 

<210> 2 
3 5 <211> 699 

<212> DNA 

<213> Cnidopus japonicus 

40 

<400> 2 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
45 Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
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Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40' 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 
Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Tyr 

50 55 60 

gga age gta gec ata ace aag tat aca tea ggg att cca gac tac ttt 240 
Gly Ser Val Ala. He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga ace aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn He Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

gag atg cgc taUaca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

-165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 '185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 

.195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gec agg tac tgt caa aca 672 
15 Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 

20 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 

25 

<210> 3 
<211> 17 

30 

<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 3 

ggigsiccih tiscitt 17 



40 



<210> 4 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence- Synthetic DNA 
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<400> 4 

aactggaaga attcgcggcc gcagaatttt tttttttttt tttt 44 

<210> 5 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 5 

ggccacgcgt cgactagtac gggiigggii gggiig 36 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 6 

agacgaggca atttccatca ag 22 



45 <210> 7 

<211> 20 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
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<400> 7 

ggccacgcgt cgactagtac 20 

<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 8 

ggctacgctt ccatattggc agtt 24 

<210> 9 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 9 

cgggatccga ccatggcttc caaaatcagc 30 

<210> 10 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence". Synthetic DNA 
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<400> 10 

ccggaattct taattgtgac caagtttaga tgggca 36 

<210> 11 
<211> 232 
<212> PRT 
, 5 <213> Cnidopus japonicus 

<400> 11 

Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 
25 20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 
35 40 45 

30 - 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Leu 
50 55 60 

35 Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 

65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 144 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
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145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 * 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 



<210> 12 
<211> 699 
<212> DNA 

<213> Cnidopus japonicus 
35 <400> 12 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

40 

1 5 ' 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
45 Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 • 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa ctt 192 
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Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Leu 

50 55 60 

gga age gta gec ata acc aag tat aca tea ggg att cca gac tac ttt 240 
Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 
100 105 110 

25 aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 

Asn Cys Leu Val Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 
115 120 125 

30 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Yal Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 
35 130 135 144 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 

40 

145 150 155 16Q 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
45 Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 



50 



55 
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His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Clu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gec agg tac tgt caa aca 672 
Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 21.5 220 

tgc cca tct aaa ctt ggt cae aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 13 - 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<400> 13 

Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 1° 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 

65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 
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100 105 110 

Asn Cys Leu Val Tyr Asn He Lys ile Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 
180 185 190 

25 His Phe Ser Asp His Arg Pro Glu Ile Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

30 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
35 225 230 



10 



15 



20 



40 



45 



<210> 14 
<211> 699 
<212> DNA 

<2i3> Cnidopus japonicus 
<40O> 14 

so atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 

Met Ala Ser Lys Ile Ser Asp Asn Val Arg Ile Lys Leu Tyr Met Glu 

1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 



55 
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Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Clu. Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa atg 192 
Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

gga age gta gec ata acc aag tat aca tea ggg att cca gac tac ttt 240 
Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr. Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn lie Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 
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Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gec agg tac tgt caa aca 672 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 



<210> 15 
35 <211> 232 

<212> PRT 

<213> Cnidopus japonicus 

40 

<400> 15 - 

Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 
45 i 5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Leu Asn lie Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 
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50 

Gly Ser Val Ala 

5 

65 

Lys Gin Ser Phe 

w 

Glu Asp Gly Ala 
100 

Asn Cys Leu Val 
115 

20 

Asn Gly Pro Val 
130 

25 Glu Met Arg Tyr 

.145 

Ala Leu Lys Cys 

30 

Thr Tyr Arg Ser 
35 180 
His Phe Ser Asp 
195 

40 

Thr Leu Phe Glu 
210 

45 Cys Pro Ser Lys 

225 

50 

<210> 16 
<211> 699 
55 <212> DNA 



EP 1 512 744 A1 

55 60 
lie Thr Lys Tyr thr Ser Gly lie Pro Asp Tyr Gly 
70 75 80 

Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 
85 90 95 

Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

105 110 
Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

120 125 
Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

135 140 
Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
150 155 160 

Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 
165 170 175 

Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

185 .190 
His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 

200 205 
Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

215 220 
Leu Gly His Asn 
230 
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<213> Cnidopus japonicus 
<400> 16 

5 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg ctg 48 
Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

1 5 . 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
, 5 Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg ctt aac ata aaa gtc ace aaa gga 144 

20 

Lys Pro Tyr Glu Gly Thr Gin Met Leu Asn lie Lys Val Thr Lys Gly 
35 40 45 

25 ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 

Gly Pro Leu Pro Phe Ser Phe Asp He Leu Thr Pro Asn Cys Gin Met 
50 55 60 

30 

gga age gta gee ata acc aag tat aca tea ggg att cca gac tac ggt 240 
Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Gly 
35 . 65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

40 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
45 Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

_ aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 

50 

Asn Cys Leu Val Tyr Asn He Lys lie Leu Gly Cys Asn Phe Pro Pro 
115 120 125 

55 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
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15 



20 



25 



30 



35 



Asn Gly Pro Val 
130 

gag atg cgc tat 
Glu Met Arg Tyr 
145 

gca ctt aaa tgc 
Ala Leu Lys Cys 



act tac 
Thr Tyr 

cat ttt 
His Phe 

aca eta 
Thr Leu 
210 
tgc cca 
Cys Pro 
225 



agg tec 
Arg Ser 
180 
tea gac 
Ser Asp 
195 

ttt gaa 
Phe Glu 

tct aaa 
Ser Lys 



Met Gin Lys Lys Thr Gin 
135 

aca cgt gat ggt gtg eta 
Thr Arg Asp Gly Val Leu 

150 
gec gat 
Ala Asp 
165 

aaa aag 
Lys Lys 



cat cgt 
His Arg 

cag cac 
Gin His 

ctt ggt 
Leu Gly 
230 



ggg aac 
Gly Asn 

gca gca 
Ala Ala 

cct gaa 
Pro Glu 
200 
gaa agt 
Glu Ser 
215 

cac aat 
His Asn 



cac etc 
His Leu 
170 
sag gcg 
Lys Ala 
185 

ata gtg 
lie Val 

tea gtg 
Ser Val 

taa 



Gly Trp Glu Pro Cys Cys 

140 
tgt ggc 
Cys Gly 
155 

act tgc 
Thr Cys 



caa aca tta 
Gin Thr Leu 



ttg cag 
Leu Gin 

aag gtt 
Lys Val 

gec agg 
Ala Arg 
220 



cat ctg aga 
His Leu Arg 
175 

atg cca ccc 
Met Pro Pro 

190 
tea gag aac 
Ser Glu Asn 
205 

tac tgt caa 
Tyr Cys Gin 



atg 480 
Met 
160 

act 528 
Thr 



ttc 576 
Phe 

ggc 624 
Gly 

aca 672 
Thr 

699 



<210> 17 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<400> 17 

Met Ala Ser Lys lie Ser Asp Asn Val Arg. He Lys Leu Tyr Met Glu 
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5 10 15 

Cly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met- Glu Asn lie Lys Val Thr Lys Gly 

35 40 • 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Phe 

50 55 60 

Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Yal Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 
210 215 220 
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Cys Pro Sei Lys Leu Gly His Asn 
225 230 

<210> 18 
<211> 699 
<212> DNA 

<213> Cnidopus japonicus 
<400> 18 

atg get tec aaa ate age gae aat gta cgt ate aag tta tat atg gag 48 

Met Ala Ser Lys lie Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu - 

5 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

.20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa .ttt 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Phe 

50 55 60 

gga age gta gee ata acc aag tat aca tea ggg att cca gac tac ttt 240^ 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 s 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
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Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100. 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa ace caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 
130 135 140 

20 . gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gee gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 
195 200 205 

45 aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 
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<210> 19 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<400> 19 

Met Ala Ser Lys lie Ser Asp Asn Val Arg. lie Lys Leu Tyr Met Glu 

1.5 10 15 

Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 
20 25 30 

. Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn lie Lys Val Thr Lys Gly 
35 40 45 

25 Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin His 

50 55 60 

Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly He Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn lie Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 
130 135 140 

so Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 

145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 
165 170 175 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp His Arg Pro Glu He Val Lys Yal Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 
210 215 220 

is Cys Pro Ser Lys Leu Gly His Asn 

225 230 

20 

<210> 20 
<211> 699 
25 <212> DNA 

<213> Cnidopus japonicus 
<400> 20 

30 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys lie Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 
35 1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg tgc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Cys Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Yal Thr Lys Gly 

35 40 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa cat 192 
Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin His 

50 55 60 

gga age gta gec ata acc aag tat aca tea ggg att cca gac tac ttt 240 
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55 
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Gly Ser Val Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

aag caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 
Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr lie Tyr 

85 90 95 

gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 
Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 
Asn Cys Leu Val Tyr Asn He Lys lie Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc tgt tgc 432 
Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Cys Cys 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 
Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

gca ctt aaa tgc gee gat ggg aac cac etc act tgc cat ctg aga act 528 
Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

cat ttt tea gac cat cgt cct gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Pro Glu lie Val Lys Val Ser Glu Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 



35 



1512744A1_L> 



EP 1 512 744 A1 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca tct aaa ctt ggt cac aat taa 699 
Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 21 
<211> 232 
<212> PRT 

<213> Cnidopus japonicus 
<400> 21 

Met Ala Ser Lys He Ser Asp Asn Val Arg lie Lys Leu Tyr Met Glu 

15 10 15 

Gly Thr Val Asn Asn His His Phe Met Val Glu Ala Glu Gly Glu Gly 

20 25 30 

Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40 45 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

Gly Ser Val Ala He Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 
65 70 75 80 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

Asn Cys Leu Val Tyr Asn lie Lys lie Leu Gly Cys Asn Phe Pro Pro 
115 120 125 
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Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Ser Cys 

130 135 140 

Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 
145 150 155 160 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 

His Phe Ser Asp Kis Arg Leu Glu lie Val Lys Val Ser Glu Asn Gly 

195 200 205 

Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

Cys Pro Ser Lys Leu Gly His Asn 
225 230 

<210> 22 
<2ll> 699 
<212> DNA 

<213> Cnidopus japonicus 
<400> 22 

atg get tec aaa ate age gac aat gta cgt ate aag tta tat atg gag 48 
Met Ala Ser Lys He Ser Asp Asn Val Arg He Lys Leu Tyr Met Glu 

1 5 10 15 

ggc aca gtc aac aat cat cac ttc atg gtc gaa get gaa gga gag ggc 96 
Gly Thr Val Asn Asn His His Phe Met Val Glu Ala Glu Gly Glu Gly 

20 25 30 

aag cca tac gag gga act caa atg gag aac ata aaa gtc acc aaa gga 144 
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Lys Pro Tyr Glu Gly Thr Gin Met Glu Asn He Lys Val Thr Lys Gly 

35 40" 45 

ggc cct ctg ccg ttc tct ttt gat ate ttg acg cct aac tgc caa tat 192 

Gly Pro Leu Pro Phe Ser Phe Asp lie Leu Thr Pro Asn Cys Gin Met 

50 55 60 

gga age gta gee ata acc aag tat aca tea ggg att cca gac tac ttt 240 

Gly Ser Yal Ala lie Thr Lys Tyr Thr Ser Gly lie Pro Asp Tyr Phe 

65 70 75 80 

aag. caa tct ttt cct gaa gga ttt acc tgg gaa aga acc aca ate tac 288 

Lys Gin Ser Phe Pro Glu Gly Phe Thr Trp Glu Arg Thr Thr He Tyr 

85 90 95 

25 gaa gat ggg get tac ctt aca act caa caa gaa acc aaa ctt gat gga 336 

Glu Asp Gly Ala Tyr Leu Thr Thr Gin Gin Glu Thr Lys Leu Asp Gly 

100 105 110 

aat tgc etc gtc tac aat att aaa ate ctt gga tgt aat ttt ccc ccc 384 

Asn Cys Leu Val Tyr Asn He Lys He Leu Gly Cys Asn Phe Pro Pro 

115 120 125 

aat ggt.cct gtg atg cag aag aaa acc caa ggc tgg gaa ccc agt tgc 432 

Asn Gly Pro Val Met Gin Lys Lys Thr Gin Gly Trp Glu Pro Ser Cys 

130 135 140 

gag atg cgc tat aca cgt gat ggt gtg eta tgt ggc caa aca tta atg 480 

45 Glu Met Arg Tyr Thr Arg Asp Gly Val Leu Cys Gly Gin Thr Leu Met 

145 150 155 160 

gca ctt aaa tgc gec gat ggg aac cac etc act tgc cat ctg aga act 528 

50 

Ala Leu Lys Cys Ala Asp Gly Asn His Leu Thr Cys His Leu Arg Thr 

165 170 175 

55 act tac agg tec aaa aag gca gca aag gcg ttg cag atg cca ccc ttc 576 
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Thr Tyr Arg Ser Lys Lys Ala Ala Lys Ala Leu Gin Met Pro Pro Phe 

180 185 190 . 

cat ttt tea gac cat cgt ctt gaa ata gtg aag gtt tea gag aac ggc 624 
His Phe Ser Asp His Arg Leu Glu lie Val Lys Val Ser Glu'Asn Gly 

195 200 205 

aca eta ttt gaa cag cac gaa agt tea gtg gee agg tac tgt caa aca 672 
15 Thr Leu Phe Glu Gin His Glu Ser Ser Val Ala Arg Tyr Cys Gin Thr 

210 215 220 

tgc cca. tct aaa ctt ggt cac aat taa .699 
Cys Pro Ser Lys Leu Gly His Asn 
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Claims 

30 1. a chromoprotein derived from Cnidopus japonicus having the following properties: 

(1) the absorption maximum wavelength is 61 0 nm, and fluorescence is not emitted; 

(2) the molar absorption coefficient is 66,700 at 610 nm; and 

(3) the pH sensitivity of light-absorbing property is stable at between pH 4 and pH 1 0. 



2. A chromoprotein having either one of the following amino acid sequences: 



(a) the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) an amino acid sequence comprising a deletion, substitution and/or addition of one or several amino acids 
*o with respect to the amino acid sequence shown in SEQ ID NO: 1 , and having light-absorbing properties. 

3. A chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein, with respect to 
the amino acid sequence shown in SEQ ID NO: 1 , alanine as an amino acid residue at position 28 is substituted 
by glycine, glutamic acid as an amino acid residue at position 41 is substituted by methionine, cysteine as an 

45 amino acid residue at position 145 is substituted by serine, and threonine as an amino acid residue at position 158 

is substituted by isoleucine. 

4. A chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
substituted by leucine with respect to the amino acid sequence shown in SEQ ID NO: 1 . 

50 

5. A chromoprotein having an amino acid sequence wherein tyrosine as an amino acid residue at position 64 is 
substituted by methionine with respect to the amino acid sequence shown in SEQ ID NO: 1 . 

6. A chromoprotein having an amino acid sequence, wherein glutamic acid as an amino acid residue at position 41 
55. is substituted by leucine, and phenylalanine as an amino acid residue at position 80 is substituted by glycine, with 

respect to the amino acid sequence shown in SEQ ID NO: 1 . 

7. A chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
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amino acid residue at position 64 is substituted by phenylalanine with respect to the amino acid sequence shown 
in SEQ ID NO: 1 . 

A chromoprotein capable of emitting fluorescence, which has an amino acid sequence wherein tyrosine as an 
amino acid residue at position 64 is substituted by histidine with respect to the amino acid sequence shown in 
SEQ ID NO: 1. 

A chromoprotein capable of emitting fluorescence, which has an amino acid sequence, wherein cysteine as an 
amino acid residue at position 26 is substituted by valine, cysteine as an amino acid residue at position 143 is 
substituted by serine, and proline as an amino acid residue at position 199 is substituted by leucine, with respect 
to the amino acid sequence shown in SEQ ID NO: 1 . 

10. A DNA encoding the protein of any of claims 1 to 9. 

15. 11. A DNA of either one of fallowings: 

(a) DNA encoding the amino acid sequence shown in SEQ ID NO: 1 ; and 

(b) DNA encoding an amino acid sequence, which comprises a deletion, substitution and/or addition of one 
or several amino acids with respect to the amino acid sequence shown in SEQ ID NO: 1 , and has light-ab- 

20 so rbing properties. 

12. A DNA having either one of the following nucleotide sequences: 

(a) the nucleotide sequence shown in SEQ ID NO: 2; and 
25 (b) a nucleotide sequence comprising a deletion, substitution and/or addition of one or several nucleotides 

with respect to the nucleotide sequence shown in SEQ ID NO: 2, and encoding a protein having light-absorbing 
properties. 

13. A DNA having the nucleotide sequence shown in any one of SEQ ID NOS: 12, 14, 16, 18, 20, or 22. 

30 

14. A recombinant vector having the DNA of any of claims 1 0 to 1 3. 

15. A transformant having the DNA of any of claims 10 to 13 or the recombinant vector of claim 14. 

35 16. A fusion protein composed of the chromoprotein of any of claims 1 to 9 and another protein. 

17. A method for analyzing a physiologically active substance, which is characterized in that the FRET (fluorescence 
resonance energy transfer) method is applied using the chromoprotein of any of claims 1 to 9 as an acceptor protein. 

40 18. A light-absorbing reagent kit comprising the chromoprotein of any of claims 1 to 9, the DNA of any of claims 1 0 to 
13, the recombinant vector of claim 14, the transformant of claim 15, or the fusion protein of claim 16. 
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Fig. 3 
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Fig. 9 
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